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ABSTRACT 

This essay provides a policy analysis and literature 
review of selected issues in teacher quality and selection . The first 
section discusses developments in: (1) teacher quality? (2) quality 
and selection of minority group teachers; (3) the validity of the 
National Teacher Examination (NTE); (4) state certification; (5) 
local eligibility requirements; (6) competeticy-based teacher 
education and certification? (7) competency testing; and (8) local 
efforts to reform teacher selection criteria. In the second section 
of the essay, studied that pertain to the relationships among teacher 
presage variables^ and desirable student outcomes are discussed. 
Teacher presage variables are defined as characteristics that 
teachers possess before they enter the classroom, including NTE 
scores, college grades, years of experience and certification, race, 
and personality traits. A conclusion summarizes findings in these 
list of 75 references is appended. (FG> 
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James J. Shields, Jr. and Richard Daniele 
This section is divided into two Darts. The first deals 
with a policy analysis of the* public controversy over the aca- 
demic competence of teachers, government efforts to mandate mere 
eouality in the hiring of minority teachers and the problem of 
finding appropriate job-related instruments for screening teacher 
applicants. This is accompanied by a description of state certi- 
fication and local eligibility reaui rement s and an analysis of 
recent trends on the cutting edge of educational reform. Parti- 
cular attention is focused on the state competency testing move- 
ment for teacher certification. 

The second part is a review of research studies relevant 
to the teacher selection process on the role of presage, process 
and product variables in predicting ..eacLer effectiveness. Spe- 
cifically, such presage variables as NTE scores, collepe grades, 
certification and years of experiences, race and personality 
factors, are discussed in terms of their validity in predicting 
changes in student achievement and behavior and evaluations of 
teacher performance . 

\£ The sources for the review were secured trough a computer 

Or, 

C search, conducted by the New York State Library, of literature 
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C This section, with some modifications, will appear in the. 

Encyclopedia of Educational Research, , 5th ed., 1982. 
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AN EQUAL OPPORTUNITY EMPLOYER ^ 



on teacher selection in ERIC and Psychological Abstracts . A second 
ahd more intensive search of ERIC, under the sponsorship of the 
American Association of Colleges of Teacher Education, provided 
more material, in particular on the National Teachers Examina- 
tion (NTE). Also, because the last ten years have been a period 
of historic involvement by the states and the federal government 
in teacher selection issues, newspapers such as the New York 
Times and government and educational association reports were 
explored. 

Policy Analysis 

The Quality Issue 

Charges have been made nationally that the quality of 
beginning teachers in terms of academic competence is declining. 
For instance, in New Mexico it was reported that none of the 
state* s 136 bilingual teachers could pass a fourth-grade-level 
Spanish examination (Crewdson, 1979). The Dallas Independent 
School District, the nation* s eighth largest urban school dis- 
trict, disclosed that more than half of its 535 new teachers 
in 1976-1977 had failed a competency test that is used to 
test the intelligence of those thirteen years and older. A 
copy of test scones obtained by the Dallas t Times Herald showed 
that administrators who took the test did even worse than a 
sample of high school students at a private school in North 
Dallas. Of the 535 teachers tested, eleven correctly answered 
ten or fewer of the 60 questions (Disputes embarrass Dallas's 
educators, 1978). 

Although there are no comprehensive data on the academic 
comp^ence of new teachers, there are statistics on education 
students as compared to noneducation students. The College 
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Board's I98fo report of Scholastic Aptitude Test scores for college- 
bound high \chool seniors showed that the* 6.1 per cent of students 
who s^id they planned to study education had some of the lowest 
scores on the test, Potential education majors had ar^ average 
verbal score of 389 and an average math .score of 418, compared 
vith the national average of 424 on the verbal section ajnd 466 
on the math section for students in all fields (Watkins, 1981) • 

Weaver (1978, 1979) reported that the scores of education 
majors on the Graduate Record Examination has declined signifi- 
cantly since 1970 and are substantially lower than those of 
majors in eight other professional f ields ^cogipared in 1975 and' 
1976 • Carnegie Commission statistics compiled on undergraduate 
teacher trainees go beyond Weaver/s findings. They suggest 
that not only are the best students, when the best implies 
"the most intellectual," not being attracted to teaching, but 
neither are the best when the definition of best is changed to 
"the least authoritarian," "the most attuned to cultural plural- 
ism," and "the most flexible in attitudes toward minority cul- 
tures" (Study Commission on Undergraduate Education and the 
Education of Teachers, 1975). 

Haberman (1972), in a survey of 386 member institutions 
of the National Council for the Accreditation of Teacher Educa- 
tion (NCATE), found that 344 of them rely heavily upon minimum 
cumulative grade point averages and accept almost anyone who 
applies (Twa & Greene, 1980). These studies generally support 
the widely held view in the academic community that teacher 
training programs admit almost anyone who meets minimum entrance % 
requirements. 
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The Equity Issue —j— ^ 

The other problem is one of equity, of whether existing 
teacher selection practices restrict the employment opportuni- 
ties of minorities ^nd/or women disproportionately compared to 
others. In 1970, the teacher selection process in New York City 
public schools was the subject of extensive hearings convened - 
because many believed that minorities, in effect, were being 
systematically excluded from teaching. At the hearings, the 
President of the Board of Education testified €hat the emphasis 
on formalistic requirements in the selection process was one ^ 

i . * . 

of the reasons there were so few Puerto Rican professionals in ^ 
Ithe New York schools (New York City Commission on Human Rights, v 
1971). 

In 1974-1975, a comparison of the racial composition of 
the teaching staffs within the five major public .schools sys- 
terns ^indicated that the percentage of minority teachers in 
the Los Angeles system .was 31.4 par cent; in Chicago, 43,2 
per cent; in Philadelphia, 40.2 per cent; and in- Detroit, ^ 
50.6 per k cent. In New York City, it was 13.2 per cent for a 
65.2 per cent student minority group population (Racial compo- 
sition: five school districts, 1990). 

Charges tfiSSt hiring practices in the New York City public 
^schools 2fn several respects were discriminatory, in effect, if 

. . * . . v . ' . • 

not in intent, resulted in a compliance agreement signed m 
1977 with the U.S. Office of Civil Rights to change the method 
by which teachers were selected and assigned * In part, the 
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need for the agreement arose out of a state decentralization 
law that permitted low achieving school districts to hire their 
own teachers. The law, designed primarily to increase the num- 
ber of minority group teachers, had the effect of creating a 
virtually segregated teaching staff* 

U.S. District • Court Judge Jack B, Weinstein upheld the 
agreement in 1979 and ruled that each district had to reflect, 
within a range of 51 per cent, the racial and ethnic composition 
of the system's teacher population as a whole for each level and 
each category. The U.S. Court of Appeals upheld Judge Weinstein 1 s 
dismissal of a reverse discrimination suit challenging the plan 
brought by the United Federation of Teachers and others. 

Supreme Court decisions, beginning with Griggs v. Duke Power 
Co . (1971^ and Dothard v. Rawlinson (1977), have established the 
principle that discrimination exists where employer practices 
have an adverse effect on minority or female employment and are 
not justified by job necessity. Where there is a suspicion that 
this happening, a procedure may be challenged on the grounds that 
it violates Title VII of tp6 Civil Rights Act of 1964 as amended 
by the Equal Employment Opportunity Act of 1972. Title VII 
regulates a wider area of conduct than do older concepts of 
discrimination (Blumrosen, 1979) . 

The Title VII standard requires a greater emphasis on 
recruitment, consideration of all qualified candidates, arid a ( 
"closer fit" between selection procedures and the requirements 
of the job than does the 14th Amendment standard. Under the 
14th Amendment, adverse effect or impact is not sufficient to 



establish illegality • Rather, there must be some further showing 

V 

of improper or invidious motive or purpose in the implementation 
of the program (Blumrosen, 1979). * C 

On the measurement of adverse impact, there is additional 
guidance provided by the 1978 Uniform Guidelines on Employee 
Selection Procedures, issued by the U.S. Departments of Justice 
and Labor, the Equal Employment Opportunity Commission, and 
the Civil Service Commission in its so-called "80 per cent" 
or "4/5ths" rule. These federal agencies have stated that if 
the selection rate for minorities is - within 80 per cent of 

the selection rate for nonmiftorities, normally np adverse impact 

i 

will be fouhd. ThpLs is stated as .a "rule of thumb," not a ^rule 
of law" (BliJinrosen, 1979). 

/ 

Job-Relatedness and the NTE 

The realization of equality as well as quality is con- 
strained by the problem of defining what actually constitutes 
valid job-related selection procedures and criteria. This 
triad of problems — quality, equality and job-related selection 
procedures — is well-documented ii^ controversies over the NTE * 
The NTE consists of a common examination and/ 25 teaching area 
examinations, which test a candidate's general knowledge of 
subject matter and pedagogical principles. It provides a 
reasonable index of general literacy, as scores on these and 
other paper-and-pencil tests are significantly affected by 
verbal skill. However, as the 'Eighth Annual Mental Measure- 
ment Yearbook reports, the NTE has never been validated for 
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job ability. The NTE, therefore, has a questionable claim to 
validity in terms of predicting who will be an effective teacher 
(Buros, 1975). 

To be considered valid, an examination or rating guide 
should meet several conditions. One, it must provide consis- 
tent or reliable scores. Two, it must acti^ally measure the 
content it claims to measure. Three, the cofrtent must be 
directly tied to' the use of the examination. Implicit in 
these conditions is the criterion that no specific group of 
candidates can be systematically discriminated against. The 
publisher pnd user are responsible for slicing that an exami- 
nation has a reasonable degree of validity (New York State 
Education Department, 1977). 

As the NTE lacks validity in terms of ^edicting the 
actual job performance of beginning teachers, legal challenges 
have been raised. The U.S. Supreme Court reasoned in Washing - 
ton W» Davis (1976) that altHoi^h demonstrable evidence of 
job-r elatedness based on a thorough analysis of teaching is 
preferable, presumptive evidence of job-relate<Jness is accept- 

4 

abl<*. The Court has upheld the fundamental Requirement of the 
completion of the appropriate level of basic education, whether 

it is college or graduate education. The base line from which 

/ 



school systems can construct a selection system for teachers, 
it was argued, is one that starts wittv^persons who have achieved 



the basic education considered necessary to function as a 

V 

teacher. 

i 

The only major case, to justify a general selection proce- 
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dure by content validity was National Education Association v. 
South Carolina Earlier, in Georgia Association of 

Education v. Nix (1976), the use of the NTE was invalidated for 
use in teacher selection because it was considered to have an 
adverse impact on minority group candidates and not to be job- 
related. In the South Carolina case, the Federal District 
Court upheld the NTE as reflecting the content of teacher educa 
tion. No showing was made that the content of the examination 

jnatched the content of the jobs for which the candidates had 

! 

applied. The Supreme Court affirmed without opinion. There- 
fore, it is not possible to be Jsure of the grounds on which 
the affirmance rests (Blumroseh, 1979). 

\ 

State Certification % \ 

\ 

In -the United States, teacher selection is decentralized 
and, consequently, complex. While it \s\bureaucratic, it is 
not monolithic. It is fragmented and confuted by overlapping 
jurisdictions shared primarily by state governments, teacher 

educators and local school districts. As a result, it is diffi 

\ 

cult to establish any plear responsibility for the, selection 
of teachers (Mesierli, 1977). Although common practices may 
emerge in a national analysis, no single description ca*i be 
entirely accurate. Each state and school district sets i\s 



own selection procedures, requirements and standards, and th^se 
vary from state to state and from district to district. \ 

After the screening involved in obtaining entrance to 
ancl exit from a teacher education program, teacher candidates 
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are screened at the state level for certification. State certi- 
fication is a designation that usually mears an individual is 
a college graduate and has completed a course of study in edu- 
cation. Actually, it was not until th2 1940* s that many states 
required teachers to be college graduates. Prior to that, two- 
year training institutions, called normal schools, provided 

JF . . 

teacher training in the cases where specific preparation was 

required at all. 

Mostly, candidates for certification are required to have 
a baccalaureate degree that includes 24 semester hours in pro- 
fessional education and a college-supervised student teaching 
experience. In some c«ses, state departments of education ac- 
cept courses completed in ano ;her degree program in lieu of 
formal teacher preparation. In addition to college degree and 
course requirements,, many states require the NTE or some other 
written, standardized test for certification. 

Local Eligibility^ Requirements 

State certification is no guarantee of emploiment. Teacher 
candidates must also satisfy an array of requirements set at 
the local school district level. For applicants, this often 
involves a review of state certification status, college tran- 
scripts, references , employment record, N?E scores, proof of 
citizenship and an interview. In addition, some districts re- 
quire a medical examination, fingerprinting, locally constructed 
tests, personality inventories and an observation in a teaching 
situation. 



In 1970, 1,100 school districts, in addition to four 
states, required the NTE. As a result, many prospective 
teachers take the NTE as a matter of course. In 1978-1979, 
for instance, 55,000 prospective teachers took the NTE. No 
passing scor,es are set by the NTE. The decision as to how 
the scores are used, whether cut-off scores are set and at 
what level, is lefc entirely to those who require the test. 

In 19 78, a detailed questionnaire orf teacher selection 
policies ahd practices wa? sent to school superintendents in 
62 large cities across the country *>ith populations of 220,000 
or more. Fifty-four of che 62 school systems responded. One- 
third of the respondents, including all of tho nation's five 
largest cities — New York, Chicago, Los Angeles, Philadelphia 
and Detroit — reported than they usea teacher selection exami- 
nations. Among chose using examinations, the oral or inter- 
view was used most often (N-13). This was followed by the 
NTE (N=ll), medical examinations (N=ll), and training evalua- 
tions (N=9). Locally prepared essay examinations were ased 
by 16.7 per cent of those using examinations, and locally 
prepared multiple-choice tests were used by 13 per cen 
(New York City Board of Examiners, 1978). 

Two-thirds of the systems that used examinations construe 
ted eligibility lists. Approximately 18.5 per cent indicated 
that the lists were rank ordered from highest to lowest test 
scores. Approximately 26 per cent reported that applicants 
were given assignment preferences on the basis of rank order. 
An estimated 11 per cent said they used other methods. Selec- 



tion and placement in these cases were based upon such criteria 
as level of basic skills in teaching, certification, a combined 
evaluation of experience, references ana college transcripts, 
and affirmation action goals. Among the systems reporting 
they did not use examinations, the interview was cited as the 
most frequently used selection procedure* These respondents 
did not consider the interview an examination (New YorK City 
Board of Examiners, 1978)* 

Competency-Based Teacher Education and Certification 

Because of the perceived failure of existing teacher 

v 

education and certification procedures, comp^ency-based 
education and certification programs are being mandated by 
stat'S-Megisla tures and departments of education* These states 
require that in place of transcript and/or program approval, 
proof be submitted that teacher applicants possess a set of 
competencies validated in terms of student growth. Oregon, 
Washington," California, Minnesota, Michigan, Pennsylvania, 
Mew Jersey, Texas and Florida were in the forefront of fnis 
movement (New York City Commission on Human Rights, 19 7 i;. 

Some states ha^e gone further in mandating competency-based 
teacher education* Bor example, Oklahoma passed legislation, 
effective in 1982, that requires tightened admission standards, 
more field-based training, a fifth year for a teaching intern- * 
ship and examinations for future teachers in subjects they will 
teach. The legislature also requires deans and faculty members 
in education programs to demonstrate every five years that they 
are up-to-date in their fields (Watkms, 1981^, 
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More and more, state legislatures are requiring teachers 

to demonstrate by means of licensing examinations that they 

know the subject matter they will teach and understand hov 

to manage a classroom before they are certified to teach. In 
*• 

1980, fourteen states required competency testing, including 
Arkansas, Florida, Louisiana, Mississippi, North Carolina, 
South Carolina, Virginia and Wisconsin, and bills for similar 
requirements are under consideration in many other states 
(Watkins, 1981; Task Force on Teacher Education and Certifi- 
cation, 1979 ). 

State efforts to tighten professional standards are 
receiving support within the teaching profession itself. 
In 1979, NCATE, which accredits 550 of the 1,350 institutions 
with teacher training programs, denied approval to more than 
30 per cent of the programs it evaluated, A few years earlier, 
approval had been denied to fewer than 10 per cent {Watkins, 
1981) . Also, the United Federation of Teachers (UFT) has 
undertaken a campaign to support the adoption of entry tests 
for new teachers . Albert Shanker has stated, "We require 
physicians, attorneys and others to pass tests before they 
are licensed. It is time we did it for teachers" (Shanker, 1980). 

Competency Testing as an Issue 

Many, including the National Education Association, con- 
sider state-wide testing a simplistic answer to a complex prob- 
lem. They argue that the NTE and other written proficiency 
tests give too UttLe attention to effective measures of per- 
formance and are based on the questionable assumption that 
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knowledge can be converted into appropriate behavior. Medley 
(1967) has concluded that any teacher effectiveness criterion 
that can be predicted with a selection test is probably irrele- 
vant to teacher competence. To use a comparison Herbert Cans 
applied within a different context, the critics seem to be 
saying, we have a "meritocracy of credentialling, " but not a 
"meri tocracy of perf ormance 11 (Messerli, 1977 ) . 

Critics are also troubled by the cultural bias in the 
tests and their exclusionary effects on minority and disadvan- 
taged populations. In this regard, Educational Testing Service 
(ETS) research psychologist Thelma Spencer reported that evidence 
to support the allegation of cultural bias in the NTE can be 
observed in the significant differences in the mean scores of 
different racial groups and in the different response patterns 
to certain types of test items found in NTE scores in North and 
South Carolina (Kiersch, 1979). 

Defenders respond that blaming bias against minorities 
on tests is like blaming a thermometer for a fever (Shanker, 
1980). Farther, they argue that special problems of corrup- 
tion and favortism would result if the Importance of testing 
were lessened. Race and ethnic identity, they predict, would 
become the overriding factors in selection, and reverse dis- 
crimination would replace the merit system (Career test 
1979). There is little evidence at the moment, however, that 
patronage is the rule or that teehers are demonstrably inferior 
in school systems where written tests are not used (Tracten- 
■berg, 1973). 
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In addition to protecting, school systems from extraneous 
influences, defenders of testing believe it functions as an 
effective method for screening out incompetents. It is con- 
ceivable, they argue, that a teacher could have a grasp of the 
knowledge component of teaching and not have .the ability to 
apply this knowledge in the classroom. However, it is impos- 
sible for them to conceive the opposite — a teacher who lacks 
the essential knowledge, but who s^ll manages to teach effec- 
tively (New York State Education Department, 1977). 

The complexity of the testing issu<=>, as it relates to 
teacher quality and equality and job-rexatedness , is well- 
illustrated in the circumstances surrounding Chance v. Boar^j 
of Examiners (1971, 1972). In this case, supervisory tests 
for the New York City public schools were challenged on the 
grounds that they were discriminatory and were not job- related. 
The result was an almost ten-year injunction against written 
supervisory examinations. Hundreds of thousands of dollars 
were spent negotiating, litigating and gaining-axpert assis- 
tance on the development of new tests. Along the way, new 
procedures, developed at great cost, were dropped because . 
they seemed too costly to implement, were potentially discrim- 
inatory and were not adequately job-related (Public Education 
Association, 1981). / 

In 1980, a new supervisory test, .developed by recognized 
experts, was administered to junior high school principal 
candidates. While 46 per cent .or more of the white applicants 
passed, no more than 17 per cent of the Hispanic and 11 per cent 
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of the black applicants passed* As a result, some experts be-' 
lieve that despite attempts to follow widely accepted validation 
procedures, the format, content and scoring of the test, and the 
fact that it is not based on an updated job definition, made it 
an invalid instrument. These experiences have convinced spme, 
including the Public Education Association, that the development 
of a valid state-wide competency test for teachers is difficult, 
if not impossible, and prohibitively costly (Public Education 
Association, 1981)* 

The Cutting Edge of Reform 

Other areas on the cutting edge of reform, in addition to 
the state competency movement, are: (1) the requirement that 
prospective teachers complete a teaching-internship year after 
graduation as a condition for certification, and (2) the inten- 
sification of efforts by local communities to recruit teachers 
from minority backgrounds* 

In terms of the first, -teachers are required to complete 
an internship or probationary period where competence is assessed 
hy a master teacher, a principal and/or a state review panel* 
This procedure holds great promise for integrating realistic 
performance measures into the certification process* An impor- 
tant issue here is what criteria to use in evaluating interns. 
In New York State, the Regents have suggested that evaluation 
be linked to pupil performance. Commenting on this, a UFT 
spokesperson said it was unrealistic and unfair to view school 
or pupil improvement as* factors in evaluating interns as it 
ignores the role of conditions beyond the classroom which affect 
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student performance (On 'teaching . .., 1980). 

The debate highlights the many questions the competency- 
based teacher education movement has raised, which have yet 
to be answered and supported adequately. Among these aret 
Should teacher or pupil behavior be used as evidence of teacher 
competency? On what basis should specific behavioral indica- 
tors of competence be selected for observation? What Measures 

have enough scope to reflect the complexity of teacher-student 

\ 

\ 

interaction? 
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The second area is reflected in the demands by some commu- 
nity groups to employ more minority teachers. These "ttemands , 
they claim, are supported by the Lau v. Nichols (1974/ decision, 
which ruled that Chinese students must have Chinese-speaking 
teachers available to them. Programs have already developed 
in response to thesexdC^ands in East Palo Alto, the Rough Rock 
Demonstration School on the Navajo Reservation and in New York 
City. In East Palo Alto, the Ravenwood district instituted an 
extensive teacher training program to prepare community resi- 
dents ate paraprof essionals and teachers. The Rough Rock Demon- 
stration School, run by a parent board, has committed itself 
to t^ainin^ Navajo teachers. New York City inaugurated a pro- 
gram for paraprof essionals drawn from local communities to 
work on college degrees in the (city University (Gittell, 1974). 

Teachers are also demanding an enlarged role in the selec- 
tion process. Recommendations are surfacing which call for the 
creation of professional practice boards. These boards would 

provide a mechanism to establish rules and procedures for appeal 

\ 
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and reexamination similar to those in effect in the legal and 

medical professions. These and other recent reforms suggest 

that the former mutuality of interest, shared by state depart- 

ments of education, teacher educators and local school districts, 

is "breaking down and is boing augmented by community groups and - 

professional teacher organizations. \ 

r 

Presage, Process and Product Variables \ 

. 7 \ 
The second part of the review essay will move from a \ 

political analysis of how the issues of teacher quality and 

t equality for minority-group teacher applicants have influenced 

the development of teacher selection criteria and procedures 
• i 

over the last decade, to a review and analysis of what educa- 
tional researchists to tell us about the major components in 
the XV teache> Selection process. The research focuses primarily 
on the relationship between tforee categories of variaVj.es: 
presage, process and product (Mitzel, 1960). Presage v^ri- 
ables are defined as those characteristics that teachers possess 



befpre they enter the classroom. These include standardized 
test scores, grades, attitudes, personality, experience and 
race. Process factors are those that desdribe what actually 
occurs in the classroom between a teacher and student. These 
consist of teacher behavior, teacher-student verbal and non- 
verbal interactions, and various kinds of teaching strategies 
and situations. Both presage and process variables are often 
investigated in terms of product variables, such as student 
outcomes . 
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The utility of this research depends finally on the degree 
to which it resolves the problem of determining what teacher 
characteristics are correlates of improved student performance 
and growth. The issues of teacher quality and equality and 
appropriate job-related selection procedures are deeply related 
to the problem of defining effective teacher characteristics. 
Rosenshine and Furst (1971), after reviewing more than 50 arti- 
cals on teacher effectiveness, stated that researchers still 
have to conduct jptore carefull^* controlled experimental studies 
v and^ to define terms better before strong conclusions can be 
drawn about the correlates of teacher effectiveness. They in- 
dicated process-product studies will not improve until research- 
ers: (1) give more attention to the prior learning of students 
and to the control of the effects of differences in pretest 
scores of ^students taught by different teachers; (2) examine 
teacher behavior over, several situations instead of just one 
(e.g., during lectured, independent study, and over different 
periods throughout a single or several* days ) ; (3) collect 
evidence of teacher effectiveness from one year to the next; 

and (4) design observationsal studies of teacher behavior within 

\ 

short periods of time. 

In a more recent review of teacher effectiveness research, 
McNeill and Popham (1973) concluded that more agreement is re- 
quired on educational goals before the competencies oi" teachers 
can be assessed accurately. Researchers could eliminate this 
problem, the authors suggested, by asking teachers to select a 
set of objectives before their effectiveness is evaluated. Then 



the relationship between presage and process variables could be 
studied in terms of which eachers reached their "contracted" 1 
goals and which did not. Rosenshine and Furst's and McNeill 
and Popham's findings highlight the very real limitations of 
studies of teacher characteristics commonly stressed in current 
selection procedures. 

What follows is a review of studies that pertain to the 
relationships between teacher presage variables mentioned most 
frequently as important in the selection process, and student 
outcomes cited as most desirable, such as improved verbal and 
math achievement and grade point average (GPA ) . The teacher 
presage variables are: (1) NTE scores; (2) college grades 
and GPA; (3) years of experiences and certification; (4) per- 
sonality traits; and (5) race. Although personality assess- 
ments were mentioned less often than other variables, they were 
included because recent f indings 'suggest that they have some 
promise as a selection procedure. 

The NTE 

The common examinations of the NTE provide scores in 
general and professional education and a weighted combination 
score of the two areas (WCET), based on scores earned by college 
seniors who have taken the examinations since 1940. However, 
neither general nor professional education, nor any of their 
subtest scores (psychological foundations of education, teaching 
principles and practices, social studies, literature and fine 
arts, and science and mathematics) have ever been equated with 
each other or from form to form. Therefore, it is not known 
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whether any particular form or subtest is more difficult than 
any other. 

Studies of the NTE fall into two categories: (1) those that 
examine pre-service preparation of teacher education students? 
and (2) those that examine teachers-in-service. The pre-service 
preparation studies deal with NTE correlates with undergraduate 
GPA, success in student teaching and graduate school scores, and 
other standardized test scores, such as the Graduate Record 
Examinations (GRE) (Duncan, 1971; Eissey, 1967; Elting, 1969; 
McCamey, 1958; Seagoe, 1949; Shea, 1955; Simpson, 1962; Thacker, 
1964; Walbery, 1967). The in-service studies deal with NTE 
correlates with supervisor ratings of personal characteristics 
of teachers, such as orderliness, attitude and skill in teaching, 
classroom observations and pupil residual gain scores (Carson, 
1969; Delaney, 1954; Eissey, 1967; Flanagan, 1941; Lewis, 1968; 
Lins, 1946; Medley & Hill, 1970; Ryans, 1951; Shea, 1955; Thacker, 
1964) . 

In general, the pre-service studies reported^ that WCET 
scores and undergraduate GPA corrections ranged from .23 to 
X .74, with a mean value of .55. This value indicates that WCET 

scores and student GPA correlates fairly closely. Furthermore, 
these findings can be generalized to a wide range of students 
since the WCET scores and GPA*s were taken from students of 
universities across the United States. However, WCET scores 
•were not found to be good predictors of actual teaching per- 
formance. For example, the correlations obtained between WCET 
scores and practice-teaching grades were between -.01 and .04. 
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On the other hand, these low correlations may be due to the ^ 
lack of reliability often found with supervisor ratings of 
teaching effectiveness. 

The in-service studies showed a rather low relationship 
between NTE scores and teacher effectiveness, although a few 
studies were encouraging. The correlations between WCET scores 
and ratings by supervisors and principals during the first year 
through the third year of teaching were about .10. The super- 
visor and principal ratings included evaluations of individual 
personality characteristics, such as personal efficiency, pro- 
fessional attitude, skill in teaching, cooperation and class- 
room management. Pupil ratings and NTE and WCET scores were 
also low, about .28. 

In 1978, Sheehan and Marcus studied 119 'teachers to exam- 
ine the relationship between NTE scores and pupil achievement. 
Close to 2,000 first-grade pupils in a large urban school dis- 
trict comprised the population investigated* Of the total 
number of teachers, 65.5 per cent were white and 35.5 per cent 
were black. Most of the teachers had bachelor degrees, and 
some had masters*. The range of years of teacher experience 
was great, from 1 to 40 years. Their WCET scores ranged from 
333 to 732, with a mean of 516. 

Sheehan and Marcus employed a step-wise regression of 
analysis. The results indicated that the WCET scores were 
significant predictors of student class-average gains in achieve- 
ment in the vocabulary and mathematics subtests of the Iowa Test 
of Basic Skills, when years of experience and college degree 
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were controlled. However, when the race of the teachers was 
included, WCET scores were no longer found to be significant * 
predictors of the students 1 achievement gains. Race seemed to 
absorb the relationship initially found between the WCET scores 
and student achievement. In that there may have been racial 
differences in the NTE performance of the teachers, the authors 
cbncluded that race and WCET scores may have been confounded. 

The NTE has shown significant correlations with other 
standardized tests. For example, correlations between WCET 
scores and GRE verbal scores were .77, and .54 with GRE ^quan- 
titative scores. They were .51 with the GRE advanced scores 
in education (Johnson, 1963). These findings suggest that the 
NTE and tests of verbal and mathematical abilities, as well as 
achievement tests in education, involve similar kinds of basic 
aptitudes . 

In a more recent study, Ayers and Quails (1979) investi- 
gated the concurrent and predictive validity of the WCET. The 
results obtained from pre-service sources, which represent con- 
current validity, indicated that the WCET was significantly 
correlated with GPA and the Teachers Area Examination (TAE) 
of the NTE and the American College Test (ACT). The ACT, how- 
ever, seemed to be a better predictor of WCET and TAE scores 
than GPA. This finding was consistent with the results re- 
ported by Ayers and Rohr (1974) in their study of the rela- 
tionship between several teacher variables to success in teache 
training programs. 
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The predictive validity of the NTE, as measured by in-service 
teacher effectiveness criteria, showed positive correlations 
between principal ratings of the subject competency of secondary 
school teachers and the WCET. These findings were consistent 
with those discovered by Brown (1974), where TAE scores and prin- 
cipal ratings of secondary school teachers were significantly 
correlated. However, in the Ayers and Quails' study no signi- 
ficant findings were discovered between principal ratings and 
the NTE scores of elementary school teachers, which is consistent 



High school and college course grades, GPA and student 
teaching and internship ratings are easy to obtain ard quantify. 
As a result, they are often used as selection criteria for pre- 
dicting teaching effectiveness. Ii; general, findings on the 
value of grades as a predictor of teaching effectiveness are 
rather discouraging. Course grades and GPA have showed low 
correlations with principal and supervisor ratings in a range 
of from .07 to .22 (Eissey, 1967; Walberg, 1967). In severax 
more recent studies on the relationship between GPA and teaching 
performance, as measured by supervisor ratings, pupil evaluations 
and student achievement gains, the correlations also were not 
significant (Stevens, 1979; Hensen, 1980). 

Correlations obtained between practice-teaching grades and 
principal and supervisor ratings were considerably higher. 
Practice-teaching grades and super\ sor ratings of personal 



with the findings reported in a review by Quirk, Witten an 




Weisberg (1972) of studies on the validity of the NTE . 
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characteristics showed a correlation of .22; with principal 
ratings, they were around ,17 (Walberg, 1967), In several 
studies that examined the relationship between college grades 
and student teaching, positive correlations were reported by 
Green^-fl977), Emanual,, Laramore and Sagan (1975), and Williams 
and Fox (1967), Finally, in another study by Twa &nd Greene 
(1980), in which the authors used several criteria for predicting 
grades in student teaching, college GPA in a student's major 
was one of the most significant predictors. 

Years of Experience and Certification 

LuPone (1961) conducted a study to determine whether 
provisionally certified elementary school teachc s in the first 
three years teaching vere as effective c:S permanently certified 
teachers. In terms of principal stings, major differences be- 
tween provisionally and permanently certified teachers emerged, 

. J 

favoring those permanently certified. Specifically, permanently 
certified teachers received better ratings in: (1) nhe ability 
to organize and plan effectively; (2) skill in translating 
subject matter into living experience; (3) proficiency in 
using materials effectively in classroom instruction; (4) under- 
standing and sympathetic attitudes toward children ; and (5) ade- 
quate utilization of remedial reading, speech, and art and music 
specialists, and he school nurse and psychologist in furthering 
their understanding of the child. Second and ' third-year teachers 
also showed greater ability and skill ii) emphasizing study habits. 

In another study, Hall (1964) discovered that fully certi- 
fied teachers in Florida were more effective in teaching language 



areas, such as paragraph meaning, word meaning and spelling, 
than teachers who had not completed prescribed education courses for 
certification. Teacher effectiveness w^s derived through measures 
of pupil gains in standard achievement tests in the areas men- 
tioned above. However, in arithmetic reasoning and computation 
test scores, no differences wer^fe found. ^ 

In a more recent study, Levin (1968) investigated the 
relationship of years of experience and verbal scores of teachers 
to student achievement* In addition, Levin did a cost-effective- 
ness analysis of the financial expense to schools of raising 
student achievement scores in verbal skills through the mani-^ 
pulation of years of experience and verbal scores of teachers 
on sfandardlzed tests* He found that one-point increases in 
teachers' verbal scopes produced higher student achievement 
increases than one-year increases in years of teaching experi- 
ence. He also found that it cost much more to raise student 
achievement levels by hiring more experienced teachers than it 
did to hire teachers with higher verbal scores* 

These findings are supported by Coleman's (1966) studies 
on school factors^ responsible for student achievement. Coleman 
found that of the teacher characteristics investigated, verbal 
ability showed the strongest relationship to student achieve- 
ment, followed by years of teaching experience* 

Race 

t 

Morgan (1976) conducted a study to examine race' as well as 



years of experience as contributing factors in the reading and 
mathematics achievement of black pupils in seyrpgated schools. 
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Over 614 second, third and fourth-grade pupils and 40 teachers 
from six schools in a city in the northeastern United States 
were investigated. The results indicated that years of experi- 
ence was related to a significant increase in gain scores in 
reading, but not in mathematics, Teachers' race had no signi- 
ficant impact on the students 1 gain scores in either mathematics 
or in reading. 

As mentioned earlier, the issue of equity in the hiring 
of minority-group teachers has received serious attention by 
government and school officials. It would seem, therefore, 
that race would draw the attention of educational researchers. 
However, this has not been the case to any significant degree. 
Among the few studies on the effects of race ard teacher effec- 
tiveness are Morgan (1976) and Sheehan and Marcus (1978), both 
discussed earlier, Summers and Wolfe (1975) and Outtz (1976). 

Summers and Wolfe (1975) found that whether or not a 
teacher was black or nonblack seemed unrelated to pupil achieve- 
ment growth, while years of experience was related. On the 
other hand, Sheehan and Marcus (1978) found that race was 
related to student achievement gains in word and number usage 
when years of experience was controlled. There is a possibility 
that the researchers confounded race and WCET scores, however, 
which would prevent any valid conclusions from being drawn 
regarding the "real" effects of racial factors on student achieve- 
mer. u . 

In another study, Outtz (1976) examined the role of racial 
oias in evaluations of black teacher performance by black and 

M 





white raters. Race was manipulated through the use of photo- 
graphs of hypothetical white and black college instructors. 
Attached were written evaluations of their teaching performance, 
judged as either "good," "average" or "poor." In general, it 
was found that if an instructor's initial performance was rated 
as either average or poor, racial factors were important. For 
example, when teaching performance was described as poor, white 
raters favored white over black instructors in terms of the 
quality of their teaching performance. When teaching perfor- 
mance was described as average, white raters favored black 
over white instructors in terms of whether they should be pro- 
moted or not. Black raters also favored average black instruc- 
tors in terms of promotability . 

Finally, Medley and Quirk (1974) examined the relationship 
between the cultural content of sets of items of the NTE and 
the performance on those items by blacks and whites. The 
study grew out of a major concern that racial bias might be 
operating in the test, which could reduce its validity and dis- 
criminate unfairly against black candidates for teaching positions. 

Two experimental forms of the NTE were developed for the 
purposes of this study. In each form, 60 items were constructed 
with an equal number of items representing "black culture," 
"modern culture" and "traditional culture." These items were 
included in place of the 65 "general" cultural items normally 
contained in the test, which presumably reflected traditional 
American content. 

The results showed that the performance of black candidates 
compared to white candidates on general culture items depended 

erJc o« 
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on whether the items referred to concerns and experiences of 
black Americans. The average black candidate compared to the 
average white in this study would score 4.4 points higher on a 
test of 65 black cultural items and 13. points lower on a test 
of 65 traditional items.- A changeover from 65 traditional to 
65 black items would increase the scores of blacks relative to 
whites by 17.8 points, or over 27 per cent. Secondly, on a 
test made up of 65 modern items, blacks would score approximately 
5 points lower than whites. These findings were consistent 
whether candidates were male or female or were tested in the 
North or the South, or in an urban or rural part of the country. 

Personality Traits 

• Research on the impact of personality on teacher effective- 
ness is discouraging. In part, this may be due to' the low x 
correlations often found between measures of /personality and 
overt behavior (Mischel, 1968), or to the lack of clarity of 
the notion of teacher effectiveness. 

Methods used to assess personality includes personality 

questionnaires and inventories k projective tests, such as the 

• • - 

Rorschach Inkblot Test and the Thematic Apperception Test (TAT); 
observational ratings of teacher classroom behavior; and un- 
structured interviews. The interview technique is of particular 
importance in any discussion of teacher selection because of its 
widespread use by school administrators in the hiring of new 
teachers (New York City Board of Examiners, 1966; 1978). As 
an instrument for assessing the potential effectiveness of 
teacher candidates, however, it has been found to be rather 
biased . 
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Merritt (1971) found results suggesting that selection 
officers often place heavy emphasis on whether or not the 
attitudes of teacher applicants are congruent with their own 
in determining whether to accept or reject candidates. Cre- 
dentials, such as NTE scores, GPA and recommendations, receive 
little weight in these decisions. Several other studies have 
documented that interviewees frequently mislead interviewers 

into making false judgments about their character and their 

\ 

actual credentials (Maier, 1966; Maier & Thurber, 1968). 

Even though, overall, the findings regarding the impact 
of personality on teacher competence are discouraging, there 
have >een some significant results that warrant consideration. 
For example, Flanagan (1961) compared Minnesota Multiphasic 
Personality Inventory (MMPI) scores of 147 female teachers 
with supervisory ratings of teacher effectiveness and found 
that those coded high in hysteria were correlated to teacher 
competence; Bowers and Soar (1962) also reached the same 
conclusion. In their study, teacher effectiveness was evalu- 
ated through the revised Observation Schedule and Record (OSCAR) 
of Medley and Mitzel (1963) and the Russell Sage Social Relations 
Test, a measure of skill in cooperative planning and work. 
Bowers and Soar, however, also found that psychopathic deviate 
codings — schizophrenia and psychoathenia — were negatively 
correlated with teacher effectiveness. In general, these 
findings suggest that the more serious the diagnostic evalua- 
tion, the less likely a teacher will be able to form construc- 
tive relationships with students . 
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Burkhard (1962) also examined various teacher personality 
traits obtained by the TAT and found that teachers who ranked 
high in the ability to explain and recognize their own limita- 
tions and strengths, were liked more by their students than 
teachers ranked low in these personality traits. In another 
study designed to determine whether certain personality traits 
were associated with student ratings of teacher effectiveness, 
Goodwin (1979) found that teachers rated "reserved" and "tough- 
minded" were better motivators and more subject oriented than 
"tender-minded" teachers. Students also perceived the more 
intelligent teachers to be fairer than less intelligent teachers. 
Estimates of intelligence and fairness, however, were both 
determined by the students, which limits any conclusions re- 
garding whether intelligence and fairness are related. 

In a more recent study, Saracho and Dayton (1980) investi- 
gated the relationship of teachers 1 cognitive style and pupil 
academic achievement residual gain scores. The Embedded Figures 
Test (EFT), which yields measures of field dependent and indepen- 
dent cognitive styles, was administered to 36 teachers and 432 
second and third-grade pupils. Teachers and children with 
similar cognitive styles were considered mismatched. Also, 
comprehensive tests of basic skills were administered twice 
during a one-year period to determine how much the children 
gained in 4 achievement that year. 

The results indicated that children with field independent 
teachers showed greater gain scores in achievement than children 
with field dependent teachers. There were no significant finding 
on student gain scores based on matched or mismatched cognitive 
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styles or within grade levels • These findings suggest that the 
less influenced a teacher is by social standards or public 
opinion and the more articulate and conceptual the cognitive 
style, the more likely higher gains in student achievement will 
be induced. 

Ayers and Quails (1979) found significant correlations 
between the NTE and the California F-Scale, which measures the 
degree to which a person is flexible and open-minded. The 
authors found that candidates who scored high on the WCET and 
TAE were more open-minded and less authoritarian than candidates 
who scored low on these tests. Althdugh these correlations do 
not test directly the use of the F-3cale as a predictor of 
teacher competence, it does offer indirect evidence that stu- 
dent-teachers who do well in training are more open-minded than 
those/ who do poorly. 

Ayers and Quails also found that the WCET was correlated 
with friendliness and cheerfulness as derived from student 
evaluations. No significant correlations, however, were dis- 
covered on personality dimensions of creativity, dynamism, 
organized demeanor or warmth and acceptance. ' 

Fox and Peck (1978) investigated the relationship between 
several personality traits, teacher behavior and indices of 
teacher effectiveness. In general, the authors found that 
high scores on efficiency based on teacher self-reports were 
related to pupil self-esteem. Teachers who described them- 
selves as introverted were the best predictors of pupil achieve- 
ment gains and positive self-evaluations. The authors also 
mentioned that the results seemed to suggest that teachers who 
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rated themselves as^ charming^ witty and sophisticated (extroverted 
measure) had negative effects dn pupil achievement and self-reports 
of their ability to cope with life's problems* 

Conclusion 

The research findings on presage variables and teacher 
effectiveness fluctuated on the basis of the type of presage 
variable studied and the product variables used as measures of 
teacher competence. Most of the studies reviewed employed pupil 
residual gains in achievement and principal and other administra- 
tor ratings as measures of teacher effectiveness. These product 
variables often showed low concurrent validity with each other 
and low correlations with the presage variables studied. 

Verbal ability showed the strongest correlations with 
measures of teacher competency based on principal and other 
supervisory ratings and pupil residual achievement gain scores, 
NTE scores showed only moderate, but significant correlations 
with these product variables. Years of experience and certifi- 
cation, although lower than verbal ability, showed rather posi- 
tive associations with teacher effectiveness as measured by 
principal ratings and pupil verbal achievement score gains. 
Math achievement gains did not show a positive correlation with 
years of experiences and certification* 

Course grades showed discouraging correlations with measures 
of teacher effectiveness* The correlations obtained were between 
•07 and .22. Practice teaching, at .17, showed up on the higher 
end of the scale. This might have been due to the similarities 
of the situations in which the ratings were derived. 
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Race registered one of the poorest correlations with teacher 
effectiveness. However, the value of this finding is greatly 
limited by the paucity of studies on race and the methodological 
problems in the studies. For example, it was reported in some 
studies that supervisor ratings of teacher competence were in- 
fluenced by racial factors. Also, it was claimed that standard- 
ized tests of achievement, such as the NTE, are culturally biased 
against minority candidates, 

A number of personality traits have shown a potential for 
a relationship to teachar effectiveness. These include field 
dependent cognitive style, open-mindedness, level of warmth 
and sincerity, and introversion. However, the correlations 
found tended to be rather low and to vary from study to study. 

In conclusion, the importance of the presage variables re- 
viewed in relationship to their value as indicators of teacher 
competence can be listed from highest to lowest in the following 
order* (1) verbal ability; (2) year s^ of experience and certifi- 
cation; (3) NTE scores; (4) grades; and (5) race. As for per- 
sonality* the data now available is not adequate to rank it ^ 
with the other variables. 

It must be noted, however, that two of the procedures 
frequently used to measure") teacher competence — interviews and 
principal ratings — often are biased. As a result, serious ques- 
tions can be raised about the conclusions drawn regarding the 
value of each of the presage variables reviewed. Research 
findings suggest that interviews often yield unreliable infor- 
mation about teacher credentials and personality traits. 
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Ratings of teacher competence, obtained through principals and 
other supervisory staff, often fall within the upper ranges of 
possible scores^. This is attributed to administrative efforts 
to project a favorable impression of teacher effectiveness. 
Therefore, it is possible that the low correlations may be 
due to the low discriminability of the product variables ana- 
lyzed. 

f 
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